Interactive Agents to Overcome Ambiguity in Software Engineering
https://arxiv.org/abs/2502.13069
#グラム・ニュービッグ
In this work, we study the ability of LLM agents to handle ambiguous instructions in interactive code generation settings by evaluating proprietary and open-weight models on their performance across three key steps: (a) leveraging interactivity to improve performance in ambiguous scenarios, (b) detecting ambiguity, and (c) asking targeted questions.
曖昧な指示にエージェントが質問するベンチマーク